Multigrid Q-learning

نویسنده

Charles W. Anderson

چکیده

Reinforcement learning scales poorly when reinforcements are delayed. The problem of propagating information from delayed reinforcements to the states and actions that have an e ect the reinforcement is similar to the problem of propagating information in a discretized boundary value problem. Multigrid methods have been shown to decrease the number of updates required to solve boundary value problems. Here we extend Q-Learning by casting it as a multigrid method and show a reduction in updates required to reach a given error level in the Q-function for a simple, 1-d Markov decision task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multigrid Algorithms for Temporal Difference Reinforcement Learning

We introduce a class of Multigrid based temporal difference algorithms for reinforcement learning with linear function approximation. Multigrid methods are commonly used to accelerate convergence of iterative numerical computation algorithms. The proposed Multigrid-enhanced TD(λ) algorithms allows to accelerate the convergence of the basic TD(λ) algorithm while keeping essentially the same per-...

متن کامل

Performance-influence models of multigrid methods: A case study on triangular grids

Multigrid methods are among the most efficient algorithms for solving discretized partial differential equations. Typically, a multigrid system offers various configuration options to tune performance for different applications and hardware platforms. However, knowing the best-performing configuration in advance is difficult, because measuring all multigrid-system variants is costly. Instead of...

متن کامل

Multigrid and Multilevel Methods for Nonconforming Rotated Q Elements

In this paper we systematically study multigrid algorithms and multilevel preconditioners for discretizations of second order elliptic problems using nonconforming rotated Q nite elements We rst derive optimal re sults for the W cycle and variable V cycle multigrid algorithms we prove that theW cycle algorithm with a su ciently large number of smoothing steps con verges in the energy norm at a ...

متن کامل

New Convergence Estimates for Multigrid Algorithms

In this paper, new convergence estimates are proved for both symmetric and nonsymmetric multigrid algorithms applied to symmetric positive definite problems. Our theory relates the convergence of multigrid algorithms to a "regularity and approximation" parameter a 6 (0,1] and the number of relaxations m. We show that for the symmetric and nonsymmetric V cycles, the multigrid iteration converges...

متن کامل

A Compact Multigrid Solver for Convection-Diffusion Equations

diffusion equations using a nine-point compact difference scheme. implementation with multigrid, and carry out a Fourier We test the efficiency of the algorithm with various smoothers and smoothing analysis of the Gauss–Seidel operator. In Secintergrid transfer operators. The algorithm displays a grid-indepention 3 we present numerical experiments that demonstrate dent convergence rate and prod...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1994

Multigrid Q-learning

نویسنده

چکیده

منابع مشابه

Multigrid Algorithms for Temporal Difference Reinforcement Learning

Performance-influence models of multigrid methods: A case study on triangular grids

Multigrid and Multilevel Methods for Nonconforming Rotated Q Elements

New Convergence Estimates for Multigrid Algorithms

A Compact Multigrid Solver for Convection-Diffusion Equations

عنوان ژورنال:

اشتراک گذاری